Smoothed Analysis of Belief Propagation for 
Minimum-Cost Flow and Matchingg* 



Tobias Brunsch 



Kamiel Cornelissen 2 
Heiko Roglin 1 



Bodo Manthey 2 



University of Bonn 
Department of Computer Science 
brunschOcs . uni-bonn. de , 
heikoOroeglin . org 



University of Twente 
Department of Applied Mathematics 
b . mantheyOutwente . nl , 
k . cornelissenOutwente . nl 



Belief propagation (BP) is a message-passing heuristic for statistical inference 
in graphical models such as Bayesian networks and Markov random fields. BP is 
used to compute marginal distributions or maximum likelihood assignments and 
has applications in many areas, including machine learning, image processing, and 
computer vision. However, the theoretical understanding of the performance of BP 
is unsatisfactory. 

Recently, BP has been applied to combinatorial optimization problems. It has 
been proved that BP can be used to compute maximum-weight matchings and 
minimum-cost flows for instances with a unique optimum. The number of iterations 
needed for this is pseudo-polynomial and hence BP is not efficient in general. 

We study belief propagation in the framework of smoothed analysis and prove 
that with high probability the number of iterations needed to compute maximum- 
weight matchings and minimum-cost flows is bounded by a polynomial if the 
weights/costs of the edges are randomly perturbed. To prove our upper bounds, 
we use an isolation lemma by Beier and Vocking (SIAM J. Comput. 2006) for 
matching and generalize an isolation lemma for min-cost flow by Gamarnik, Shah, 
and Wei (Operations Research, 2012). We also prove almost matching lower tail 
bounds for the number of iterations that BP needs to converge. 

1 Belief Propagation 

The belief propagation (BP) algorithm is a message-passing algorithm that is used for solving 
probabilistic inference problems on graphical models. It has been introduced by Pearl in 
1988 [8j. Typical graphical models to which BP is applied are Bayesian networks and Markov 
random fields. There are two variants of BP. The sum-product variant is used to compute 
marginal probabilities. The max-product or min-sum variant is used to compute maximum a 
posteriori (MAP) probability estimates. 

"This research was supported by ERC Starting Grant 306465 (BeyondWorstCase) and NWO grant 613.001.023 
(Smoothed Analysis of Belief Propagation) . 
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Recently, BP has experienced great popularity. It has been applied in a large number of 
fields, such as machine learning, image processing, computer vision, and statistics. For an 
introduction to BP and several applications, we refer to Yedidia et al. |17] . The are basically 
two main reasons for the popularity of BP. First of all, it is generally applicable and easy to 
implement because of its simple and iterative message-passing nature. In addition, it performs 
well in practice in numerous applications [15|116|. 

If the graphical model is tree-structured, BP computes exact marginals/MAP estimates. 
In case the graphical model contains cycles, convergence and correctness of BP have been 
shown only for specific classes of graphical models. To improve the general understanding 
of BP and to gain new insights about it, the performance of BP as either a heuristic or an 
exact algorithm for several combinatorial optimization problems has been studied. Amongst 
others it has been applied to the maximum-weight matching (MWM) problem, the minimum 
spanning tree (MST) problem, the minimum-cost flow (MCF) problem, and the maximum- 
weight independent set problem }12j. 

Bayati et al. [1] have shown that max-product BP correctly computes the MWM in bipartite 
graphs in pseudo-polynomial time if it is unique. For MST, it is known that BP converges to the 
correct optimal solution, if it converges at all (which is not guaranteed) [3J. Gamarnik et al. [B] 
have shown that the max-product BP algorithm computes the MCF in pseudo-polynomial time 
if it is unique. 

1.1 Belief Propagation for Matching and Flow Problems 

In this section we discuss the previous results about the BP algorithm for computing maximum- 
weight matchings and minimum-cost flows in more detail. Bayati et al. [4] have shown that 
the max-product BP algorithm correctly computes the maximum-weight matching in bipartite 
graphs if it is unique. Convergence of BP takes pseudo-polynomial time and depends linearly 
on the weight of the heaviest edge and on 1/5, where 5 is the difference in weight between 
the best and second-best matching. In Section \2. 31 we describe the BP algorithm for MWM in 
detail. 

Belief propagation has also been applied to finding maximum-weight perfect matchings in 
arbitrary graphs and to finding maximum- weight perfect 6-matchings [2j[IT|, where a perfect 
6-matching is a set of edges such that every vertex is incident to exactly b edges in the set. 
For arbitrary graphs the BP algorithm for MWM does not necessarily converge However, 
Bayati et al. [2J and Sanghavi et al. [11] have shown that the BP algorithm converges to the 
optimal matching if the relaxation of the corresponding linear program has an optimal solution 
that is unique and integer. The number of iterations needed until convergence depends again 
linearly on the reciprocal of the parameter 5. Bayati et al. [2] have also shown that the same 
result holds for the problem of finding maximum-weight 6-matchings that do not need to be 
perfect. 

It turns out that BP can, to some extent, solve the relaxation of the corresponding linear 
program for matching, even if it has a non-integral optimal solution. Bayati et al. [2j have 
shown that it is possible to solve the LP relaxation by considering so-called graph covers, in 
which they compute a bipartite matching. In case of an optimum that is unique and integer, 
the optimal solution in the graph cover corresponds to the optimal solution. In case of a unique 
but fractional optimal solution, the average of the estimates of two consecutive iterations (both 
of which are perfect matchings in the graph cover) yield a value of 0, 1/2, or 1 for any edge, 
which then equals its value in the optimal solution of the relaxed LP. 
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Sanghavi et al. have shown that BP remains uninformative for some edges (and outputs 
"?" for those), but computes the correct values for all edges that have a fixed integral value 
in all optimal solutions. 

Gamarnik et al. [6] have shown that BP can be used to find a minimum-cost flow, provided 
that the instance has a unique optimal solution. The number of iterations until convergence 
is pseudo-polynomial and depends again linearly on the reciprocal of the difference in cost 
between the best and second-best integer flow. In addition, they have proved a discrete isolation 
lemma [6j Theorem 8.1] that shows that the edge costs can be slightly randomly perturbed to 
ensure that, with probability at least 1/2, the perturbed MCF instance has a unique optimal 
solution. Using this result, they have constructed an FPRAS for MCF using BP. 

1.2 Smoothed Analysis 

Smoothed analysis has been introduced by Spielman and Teng [13] in order to explain the 
performance of the simplex method for linear programming. It is a hybrid of worst-case and 
average-case analysis and an alternative to both: An adversary specifies an instance, and this 
instance is then slightly randomly perturbed. The perturbation can, for instance, model noise 
from measurement. Since its invention in 2001, smoothed analysis has been applied in a variety 
of contexts. We refer to two recent surveys [ZHH] for a broader picture. 

We apply smoothed analysis to belief propagation for min-cost flow and maximum-weight 
matching. To do this, we consider the following general probabilistic model. 

• The adversary specifies the graph G = (V, E) and, in case of min-cost flow, the integer 
capacities of the edges and the integer budgets (both are not required to be polynomially 
bounded). Additionally the adversary specifies a probability density function f e : [0, 1] — >■ 
[0, 4>] for every edge e. 

• The costs (for min-cost flow) or weights (for matching) of the edges are then drawn 
independently according to their respective density function. 

The parameter </> controls the adversary's power: If <fr = I, then we have the average case. The 
larger </>, the more powerful the adversary. The role of 4> is the same as the role of I/a in the 
classical model of smoothed analysis, where instances are perturbed by independent Gaussian 
noise with standard deviation a. In that model the maximum density <p is proportional to I/a. 

1.3 Our Results 

We prove upper and lower tail bounds for the number of iterations that BP needs to solve 
maximum-weight matching problems and min-cost flow problems. Our bounds match up to a 
small polynomial factor. 

We prove that the probability that BP needs more than t iterations is bounded by 0(n 2 m(j)/t) 
for the min-cost flow problem and 0[nm(j)/t) for various matching problems, where n and m are 
the number of nodes and edges of the input graph, respectively (Sections [3] and H|. The upper 
bound for matching problems holds for the variants of BP for the maximum-weight matching 
problem in bipartite graphs [3] as well as for the maximum-weight (perfect) 6-matching problem 
in general graphs [21 E]. For the latter it is required that the polytope corresponding to 
the relaxation of the matching LP is integral. If this is not the case, we can still solve the 
relaxation of the matching LP with a slightly modified BP algorithm [2] (using graph covers, 
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see Remark I4.3p . To prove the upper tail bound for BP for MCF we use a continuous isolation 
lemma that is similar to the discrete isolation lemma by Gamarnik et al. [6] Theorem 8.1]. We 
need the continuous version since we not only want to have that a unique optimal solution, 
but we also need to quantify the gap between the best and the second-best solution. 

These upper tail bounds are not strong enough to yield any bound on the expected number 
of iterations. Indeed we show that this expectation is not finite by providing a lower tail 
bound of Q(n(f)/t) for the probability that t iterations do not suffice to find a maximum- weight 
matching in bipartite graphs. This lower bound even holds in the average case, i.e., if (f) = 1 
(Section 15. 2p . and it carries over to the variants of BP for the min-cost flow problem and 
the minimum/maximum- weight (perfect) 6-matching problem in general graphs mentioned 
above (2jSl[6l[TT] . The lower bound matches the upper bound up to a factor of 0(m) for 
matching and up to a factor of 0(nm) for min-cost flow (Section 15. 3p . The smoothed lower 
bound even holds for complete (i.e., non-adversarial) bipartite graphs. 

Note that our lower bound on the number of iterations until convergence does not contradict 
the results by Salez and Shah [ID]. Roughly speaking, they have proved that BP for matching 
requires expected time 0(n 2 ). However, they allow that a small number of nodes are matched 
to incorrect nodes. It might even be the case that multiple nodes are matched to the same 
node. In our analysis we require convergence of the BP algorithm, i.e., each node should be 
matched to the unique node to which it is matched in the optimal matching. 

Finally, let us remark that, for the min-cost flow problem, we bound only the number of 
iterations that BP needs until convergence. The messages might be super-polynomially long. 
For all matching problems, however, the length of the messages is always bounded by a small 
polynomial. 

2 Definitions and Problem Statement 

In this section we define the maximum-weight matching problems that we consider and the 
min-cost flow problem. We also describe the BP algorithms that we apply. 

2.1 Maximum- Weight Matching and Minimum-Cost Flow 

First we define the maximum- weight matching problem on bipartite graphs. For this con- 
sider an undirected weighted bipartite graph G = (UUV,E) with U = {ui, . . . ,u n }, V = 
{v±, ... ,v n }, and E C {(u%,Vj) = e^-,1 < i,j < n}. Each edge has weight Wij E M + . A 
collection of edges M C E is called a matching if each node of G is incident to at most one 
edge in M. We define the weight of a matching M by 



A ^-matching M C E in an arbitrary graph G = (V, E) is a set of edges such that every node 
from V is incident to at most b edges from M. A 6-matching is called perfect if every node 
from V is incident to exactly b edges from M. Also for these problems we assume that each 
edge e € E has a certain weight w e and we define the weight of a 6-matching M accordingly. 




The maximum-weight matching M* of G is defined as 



M* = argmax{w(M) | M is a matching of G}. 
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2.2 Min-Cost Flow Problem 

In the min-cost flow problem (MCF) the goal is to find a cheapest flow that satisfies all capacity 
and budget constraints. We are given a graph G = (V, E) with V = {vi, . . . , v n }. In principle 
we allow multiple edges between a pair of nodes, but for ease of notation we consider simple 
directed graphs. Each node v has a budget b v G Z. Each directed edge e = e»j from Vj to Vj 
has capacity u e G No and cost c e G M + . For each node u G V, we define as the set of edges 
incident to v. For each edge e £ E„ we define A(v, e) = 1 if e is an out-going edge of v and 
A(v , e) = —1 if e is an in-going edge of v. In the MCF one needs to assign a flow f e to each 
edge e such that the total cost X^ee-B c e/e is minimized and the flow constraints 

< f e < u e for all e G E, 

and budget constraints 

A(u, e)/ e = for all u G V" 

are satisfied. We refer to Ahuja et al. [1] for more details about MCF. 

Let us remark that we could have allowed also rational values for the budgets and capacities. 
As our results do not depend on these values, they are not affected by scaling all capacities 
and budgets by the smallest common denominator. 

Note that finding a perfect minimum-weight matching in a bipartite graph G = (U U V, E) 
is a special case of the min-cost flow problem, see Ahuja et al. [1] for details. 

2.3 Belief Propagation 

For convenience, we describe the BP algorithm used by Bayati et al. [4j. For the details of the 
other versions of BP for the (perfect) maximum-weight 6-matching problem and the min-cost 
flow problem we refer to the original works OEKH] . When necessary, we discuss the differences 
between the different versions of BP in Sections [5] and [5j 

The BP algorithm used by Bayati et al. [3] is an iterative message-passing algorithm for 
computing maximum- weight matchings (MWM). Bayati et al. define their algorithm for com- 
plete bipartite graphs G = (UUV,E) with \U\ = \V\ = n. In each iteration t, each node Ui 
sends a message vector 

M\ 3 = [Al j (l),r4 j (2),...,rh t ij (n)} 

to each of its neighbors Vj. The messages can be interpreted as how 'likely' the sending node 
thinks it is that the receiving node should be matched to a particular node in the MWM. The 
greater the value of the message m^ (r), the more likely it is according to node Ui in iteration 
t that node Vj should be matched to node u r . Similarly, each node Vj sends a message vector 

Mjt to each of its neighbors Uj. The messages are initialized as 

if r = i, 
otherwise and 

if r = j, 
otherwise. 



m°(r) 



w 



w 
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The messages in iterations t > 1 are computed from the messages in the previous iteration 
as follows 
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if r = i, 

otherwise and 
if r = j, 

otherwise. 



The beliefs of nodes U{ and Vj in iteration t are defined as 

k 

b%{r)=w rj + ^m t kj (r). 



The beliefs can be interpreted as the 'likelihood' that a node should be matched to a particular 
neighbor. The greater the value of b l Ui {j), the more likely it is that node Ui should be matched 
to node Vj. We denote the estimated MWM in iteration t by M . The estimated matching 
M t matches each node Ui to node Vj, where j = argmax 1<r<n {6*.(r)}. Note that M* does not 
always define a matching, since multiple nodes may be matched to the same node. However, 
Bayati et al. [1] have shown that if the MWM is unique, then for t large enough, M is a 
matching and equal to the MWM. 



3 Isolation Lemma for Maximum- Weight Matchings and 
Min-Cost Flows 

Before we turn to proving the upper tail bounds for the number of iterations of the BP algorithm 
in Section we take a closer look at the quantity 6, which we defined above as the difference 
in weight or cost between the best and second-best matching or integer flow, respectively. The 
previous results discussed in Section 11.11 indicate that in order for the BP algorithm to be 
efficient 5 must not be too small. While 5 can be arbitrarily small for weights or costs that are 
chosen by an adversary, it is a well-known phenomenon that 5 is with high probability not too 
small when the weights or costs are drawn randomly. 



3.1 Maximum- Weight Matchings 

Beier and Vocking [5] have considered a general scenario in which an arbitrary set S C {0, l} m 
of feasible solutions is given and to every x = (x±, . . . , x m ) eSa weight w ■ x = w\X\ + . . . + 
w m x m is assigned by a linear objective function. As in our model they assume that every 
coefficient Wi is drawn independently according to an adversarial density function fi : [0, 1] — » 
[0, (j)] and they define 5 as the difference in weight between the best and the second-best feasible 
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solution from S, i.e., 5 = w-x* — w-x where x* = axgmax xeS w ■ x and x = argmax xeS \r x *-i w-x. 
They prove a strong isolation lemma that, regardless of the adversarial choices of S and the 
density functions fi, the probability of the event 5 < e is bounded from above by 2e<pm for 
any e > 0. 

If we choose S as the set of incidence vectors of all matchings or (perfect) 6-matchings 
in a given graph, Beier and Vocking's results yield for every e > an upper bound on the 
probability that the difference in weight 5 between the best and second-best matching or 
the best and second-best (perfect) 6-matching is at most e. Combined with the results in 
Section 11.11 this can immediately be used to obtain an upper tail bound on the number of 
iterations of the BP algorithm for these problems. 

3.2 Min-Cost Flows 

The situation for the min-cost flow problem is significantly more difficult because the set S of 
feasible integer flows cannot naturally be expressed with binary variables. If one introduce a 
variables for each edge corresponding to the flow on that edge, then S C {0, 1, 2, ... , u max } TO 
where u max = max eg E u e . Roglin and Vocking [9] have extended the isolation lemma to the 
setting of integer, instead of binary, vectors. However, their result is not strong enough for 
our purposes as it bounds the probability of the event 5 < e by ecj)m(u mSkX + l) 2 from above 
for any e > 0. As this bound depends on u max it would only lead to a pseudo-polynomial 
upper tail bound on the number of iterations of the BP algorithm when combined with the 
results of [6J. Our goal is, however, to obtain a polynomial tail bound that does not depend 
on the capacities. In the remainder of this section, we prove that the isolation lemma from [S] 
can be significantly strengthened when structural properties of the min-cost flow problem are 
exploited. 

In the following we consider the residual network for a flow /. For each edge in the 
original network that has less flow than its capacity Uij, we include an edge ey with capacity 
Uij — fij in the residual network. Similarly, for each edge e%j that has flow greater than zero, 
we include the backwards edge eji with capacity fij in the residual network. We refer to Ahuja 
et al. [lj for a more details about residual networks. 

As all capacities and budgets are integers, there is always a min-cost flow that is integral. 
An additional property of our probabilistic model is that with probability one there do not 
exist two different integer flows with exactly the same costs. This follows directly from the fact 
that all costs are continuous random variables. Hence, without loss of generality we restrict 
our presentation in the following to the situation that the min-cost flow is unique. 

In fact, Gamarnik et al. |6] have not used 5, the difference in cost between the best and 
second-best integer flow, to bound the number of iterations needed for BP to find the unique 
optimal solution of MCF, but they have used another quantity A. They have defined A as 
the length of the cheapest cycle in the residual network of the min-cost flow /*. Note that A 
is always non-negative. Otherwise, we could send one unit of flow along a cheapest cycle. 
This would result in a feasible integral flow with lower cost. With the same argument we can 
argue that A must be at least as large as 5 because sending one unit of flow along a cheapest 
cycle results in a feasible integral flow different from /* whose costs exceed the costs of /* by 
exactly A. Hence any lower bound for 5 is also a lower bound for A and so it suffices for our 
purposes to bound the probability of the event 5 < e from above. 

The isolation lemma we prove is based on ideas that Gamarnik et al. [6j Theorem 8.1] have 
developed to prove that the optimal solution of a min-cost flow problem is unique with high 
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probability if the costs are randomly drawn integers from a sufficiently large set. We provide 
a continuous counterpart of this lemma, where we bound the probability that the second-best 
integer flow is close in cost to the optimal integer flow. 

Lemma 3.1. The probability that the cost of the optimal and the second-best integer flow 
differs by at most e > is bounded from above by 2e0m. 

Proof. Consider any fixed edge e, and let the costs of all other edges be fixed by an adversary. 
The cost eg of e is drawn according to its probability distribution, whose density is bounded 
by 4>. 

For e € E, let E| be the event that there exist two different integer flows /* and / with the 
following properties: 

(i) /* is optimal. 

(ii) c • /* and c • / differ by at most e, i.e., c • / < c • /* + e. 

(iii) /* = and f e > 0. 

Let be analogously defined, except for Condition (fm|) being replaced by /* = u e and f e <u e . 

Claim 3.2. Let e E E be arbitrary. Assume that all costs except for c e are fixed. Let L C [0, 1] 
be the set of real numbers such that L = {c e | E|}. Then I is a subset of an interval of length 
at most e. 

Proof. If / ^ 0, let a = min(I) and let /* be an optimal integer flow for c e = a with /* = 0. 
Due to the choice of a it is clear that I C [a, oo) We claim that I Q [a,a + e]. If c e = a + r\ 
for some r/ > 0, then /* stays optimal, and, for any feasible integer solution / with f e > 0, we 
have 

C • / = ^ C e/e + (« + V)fe > ^ Cg/g + Oif e + 7] as f e > 1 

e^e e^e 

> c • /* + r) as /* = and /* is optimal. 

Thus, for r] > e, the event E| does not occur. □ 

The proof of the following claim is omitted as it is completely analogous to the proof of the 
previous claim. 

Claim 3.3. Let e € E be arbitrary. Assume that all costs except for c e are fixed. Let L C [0, 1] 
be the set of real numbers such that L = {c e | E e }. Then I is a subset of an interval of length 
at most e. 

The following claim shows that, provided no event EJ or occurs, the second-best integer 
flow is more expensive than the best integer flow by at least an amount of e. 

Claim 3.4. Assume that for every edge e G E neither E| nor E e £ occurs. Let f* be a min-cost 
flow and let f ^ f* be a min-cost integer flow that differs from f* , i.e., a second-best integer 
flow. Then c ■ f > c ■ f* + e. 



8 



Proof. First we prove that under our assumption that the min-cost flow is unique some edge 
e exists such that /* £ {0,u e } and f e ^ /*. Suppose that no such edge e exists and let 
d = f* — /. Then d e > only if /* < u e and d e < only if /* > because otherwise there is 
an edge e with /* G {0, ii e } and / e ^ f*. From this, we can conclude that there exists a A > 
such that /* + Ad is a feasible flow. Let Ao = max{A | /* + Xd is feasible} and / = /* + X^d. 
From the assumption that the min-cost flow is unique it follows that c ■ d = c ■ f* — c ■ f < 0. 
Hence, c • / < c • /*, contradicting the choice of f* as min-cost flow. 

This argument shows that there always exists an edge e such that /* £ {0, u e } and f e ^ f*. 
As none of the events EJ and E e occurs for this edge e, it follows that c • / > c • /* + e. □ 

From Claims 1331 and 1331 we obtain P(EJ) < e<j) and P(E^) < ecf): We fix all edge costs except 
for c e and then EJ can only occur if c e falls into an interval of length at most e. Since the 
density function of c e is bounded from above by </>, this happens with a probability of at most 
E(f). The same holds for any E £ . Thus, the lemma follows by a union bound over all 2m events 
EJandE". " □ 

The isolation lemma (Lemma l3.1|) together with the discussion about the relation between 5, 
the difference in cost between the best and second-best integer flow, and A, the length of the 
cheapest cycle in the residual network of the min-cost flow /*, immediately imply the following 
upper bound for the probability that A is small. 

Corollary 3.5. For any e > 0, we have P(A < e) < 2ecj)m. 



4 Upper Tail Bounds 

4.1 Maximum- Weight Matching 

We first consider the BP algorithm of Bayati et al. [3], which computes maximum- weight 
matchings in complete bipartite graphs G in 0{nw* /5) iterations on all instances with a unique 
optimum. Here w* denotes the weight of the heaviest edge and 5 denotes the difference in 
weight between the best and the second-best matching. Even though it is assumed that G is 
a complete bipartite graph, this is not strictly necessary. If a non-complete graph is given, 
missing edges can just be interpreted as edges of weight 0. 

With Beier and Vocking's isolation lemma (cf. Section l3~T|) we obtain the following tail bound 
for the number of iterations needed until convergence when computing maximum-weight perfect 
matchings in bipartite graphs using BP. 

Theorem 4.1. Let r be the number of iterations until Bayati et al. 's BP ^ for maximum- 
weight perfect bipartite matching converges. Then P(r > t) = 0(nm(j)/t). 

Proof. The number of iterations until BP converges is bounded from above by 0{nw* /5) [3]. 
The weight of each edge is at most 1, so w* < 1. The upper bound exceeds t only if 5 < 0(n/t). 
By Beier and Vocking's isolation lemma, we have ¥(8 < 0(n/t)) < 0(nm(j)/t), which yields 
the bound claimed. □ 

This tail bound is not strong enough to yield any bound on the expected running-time of 
BP for bipartite matchings. But it is strong enough to show that BP has smoothed polynomial 
running-time with respect to the relaxed definition adapted from average-case complexity [5], 
where it is required that the expectation of the running-time to some power a > is at most 
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linear. However, a bound on the expected number of iterations is impossible, and the tail 
bound proved above is tight up to a factor of 0(m) (Section [5]). 

As discussed in Section fl.il BP has also been applied to finding maximum- weight (perfect) 
6-matchings in arbitrary graphs [2 llllj . The result is basically that BP converges to the optimal 
matching if the optimal solution of the relaxation of the corresponding linear program is unique 
and integer. The number of iterations needed until convergence depends again on "how unique" 
the optimal solution is. For Bayati et al.'s variant [2], the number of iterations until convergence 
depends on 1/5, where 5 is again the difference in weight between the best and the second- 
best matching. For Sanghavi et al.'s variant [TT], the number of iterations until convergence 
depends on 1/c, where c is the smallest rate by which the objective value will decrease if we 
move away from the optimum solution. 

However, the technical problem in transferring the upper bound for bipartite graphs to 
arbitrary graphs is that the adversary can achieve that, with high probability or even with 
a probability of 1 (for larger </>), the optimal solution of the LP relaxation is not integral. 
Already in the average-case, i.e., for <p = 1, where the adversary has no power at all, the 
optimal solution of the LP relaxation has some fractional variables with high probability. 

Still, we can transfer the results for bipartite matching to both algorithms for arbitrary 
matching if we restrict the input graphs to be bipartite, since in this case the constraint 
matrix of the associated LP is totally unimodular. 

Theorem 4.2. Let r be the number of iterations until Bayati et al. 's [2] or Sanghavi et 
al. 's [11] BP for general matching, restricted to bipartite graphs as input, converges. Then 
P( r > t) = 0(nm(j)/t). 

Proof. For Bayati et al.'s BP algorithm, this follows in the same way as Theorem 14.11 from 
their bound on the number of iterations until convergence, which is 0(n/5) [2j Theorem 1]. 

Sanghavi et al. prove that their variant of BP for general graphs converges after 0(1 /c) 
iterations, provided that the LP relaxation has no fractional optimal solutions. Here, c is 
defined as 

w ■ (x* — x) 

c = mm — — | — , 

i^i* is a vertex of P \\X* — X h 

where x* is the (unique) optimal solution to the relaxation and P is the matching polytope |1R 
Remark 2]. 

For any x ^ x* , we have ||a;* — x||i < n. Furthermore, w ■ (x* — x) is just the difference in 
weights between x* and x. Since the input graph is bipartite, all vertices of P are integral. 
Thus, w ■ (x* — x) > S, where (again) 5 is the difference in weight between the best and the 
second-best matching. Thus, c > S/n, which proves the theorem. □ 

Remark 4.3. Bayati et al. [2] and Sanghavi et al. fllf have also shown how to compute b- 
matchings with BP. Ifbis even, then the unique optimum to the LP relaxation is integral. Thus, 
we circumvent the problem that the optimal solution might be fractional. Hence, following the 
same reasoning as above, the probability that BP for b-matching for even b runs for more than t 
iterations until convergence is also bounded by 0{mn(j)/t). 

Furthermore, Bayati et al. JH Section 4] have shown how to compute the optimal solution 
of the relaxation of the matching LP with graph covers. They obtain the same 0(n/5) bound 
for the number of iterations until convergence as for ordinary matching. However, since we 
are no longer talking about integer solutions, we cannot directly apply the isolation lemma of 
Beier and Vocking J3J/. To see that 5 is still unlikely to be small in the same way (with a 
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slightly worse constant), we can apply the isolation lemma of Roglin and Vocking 19] since the 
matching polytope is half-integral. Thus, if we scale the right-hand side with a factor of 2, then 
we obtain a 0/1/2 integer program. Because of this, we obtain the same 0(mn(j)/t) tail bound 
for the probability that the number of iterations until convergence exceeds t. 

4.2 Min-Cost Flow 

The bound for the probability that A is small (Corollary 13. 5p plus the pseudo-polynomial 
bound by Gamarnik et al. [6] yields a tail bound for the number of iterations that BP needs 
until convergence. 

Theorem 4.4. Let r be the number of iterations until BP for min-cost flow fd^ converges. 
Then P(r > t) = 0{n 2 m<j)/t). 

Proof. The number of iterations until BP for min-cost flow converges is bounded from above 
by cLn/A for some constant c, where L is the maximum cost of a simple directed path in 
the residual network for the optimal flow [6, Theorem 4.1]. The cost of each edge is at most 
1, so L < n. The upper bound exceeds t only if A < en 2 jt. By Corollary 13.51 we have 
P(A < en 2 /t) < 2cn 2 m<j)/t, which yields the bound claimed. □ 

5 Lower Tail Bounds 

We show that the expected number of iterations necessary for convergence of BP for maximum- 
weight matching (MWM) is unbounded. To do this, we prove a lower tail bound on the number 
of iterations that matches the upper tail bound from Section [H The lower bound holds even for 
a two by two complete bipartite graph with edge weights drawn independently and uniformly 
from the interval [0,1]. In the following analysis, we consider the BP variant introduced by 
Bayati et al [3]. Our results can be extended to other versions of BP for matching and min-cost 
flow [21EHH] in a straightforward way. We discuss these extensions in Section 15.41 

We first discuss the average case, i.e., <f> = 1, for which we obtain a lower tail bound of Vt(n/t) 
for the probability that more than t iterations are needed for convergence (Section l5.2p . For this 
lower bound, we use a simple adversarial graph. We leave it as an open problem whether the 
lower bound also holds for the complete bipartite graph on n vertices. After that, we consider 
complete bipartite graphs with smoothed weights and prove a lower bound of Q(n<p/t) for the 
probability that more than t iterations are needed for convergence (Section 15. 3p . We conclude 
this section with a discussion on how to transfer our results to the other variants of BP for 
matching and min-cost flow. 

5.1 Computation Tree 

For proving the lower bounds, we need the notion of a computation tree, which we define 
analogously to Bayati et al. [3J. 

Let G = (U U V, E) be a bipartite graph with U = {u\, . . . , u n } and V = {t>i, . . . , v n }. We 
denote the level-k computation tree with the root labeled x € ?7U V by T k (x). The tree T k (x) 
is a weighted rooted tree of height k + 1. The root node in T°(x) has label x, its degree is the 
degree of x in G, and its children are labeled with the adjacent nodes of x in G. T k+1 (x) is 
obtained recursively from T k (x) by attaching children to every leaf node in T k {x). Each child 
of a former leaf node labeled y is assigned one vertex adjacent to y in G as a label, but the 
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label of the former leaf node's parent is not used. (Thus, the number of children is the degree 
of y minus 1.) Edges between nodes with label Ui and label Vj in the computation tree have a 
weight of Wij . 

We call a collection A of edges in the computation tree T k (x) a T -matching if no two edges 
of A are adjacent in T k (x) and each non-leaf node of T k (x) is the endpoint of exactly one edge 
from A. Leaves can be the endpoint of either one or zero edges from A. Let t k {ui;r) be the 
weight of a maximum weight T-matching in T k (ui) that uses the edge (ui,v r ) at the root. 



5.2 Average-Case Analysis 

Consider the undirected weighted complete bipartite graph K 2>2 = (U U V, E), where U = 
{til, 1*2}, V = {^1,^2}, and (u{,Vj) G E for 1 < i, j < 2. Each edge (m,Vj) = eij has weight 
drawn independently and uniformly from [0, 1]. We define the event E £ for < e < g as the 
event that wu G [|, l] , w\ 2 G (5, |] , 1021 G (|, §] , and w 22 G [wu +w 2 ± —W\\ -e,wi 2 + w 2 i - 
w\i). Consider the two possible matchings M\ = {en, 622} and M 2 = {ei 2 ,e 2 i}. If event E £ 
occurs, then the weight of M 2 is greater than the weight of Mi and the weight difference is 
at most e. In addition, w\\ is greater than w\ 2 and the weight difference is at least 1/4. See 
Figure [1] for a graphical illustration. 



w lx e [|,i] 

Ui ^- Vl 

wi 2 G(itr^. (|,|] 



u 2 m^- 



v 2 



W22 G \w\2 + W21 - Wn - e, wu + W21 - wu, 



Figure 1: If event E e occurs, then the weight of the dashed matching M 2 = {e\ 2 , ^21} is greater 
than the weight of the solid matching M\ = {en, 622} and the weight difference is at 
most e. In addition wn is greater than 1012 and the weight difference is at least 7. 



Lemma 5.1. The probability of event E £ is e/8 3 . 



Proof. The intervals in which w\\, w\ 2 , and w 2 \ have to assume values in order for event E £ 
to occur all have a length of 1/8. The interval in which ^22 has to take a value in order for 



event E £ to occur, has a length of e. It is contained completely in the interval (0, \ I , since 



1 5 n 1 „ 

W12 + w 2 i - w n - e > - + - - 1 - - = 

Z o o 

and 

5 3 7 1 
wu + w 2l -w u < - + --- = -. 

Now the probability that W\\, w\ 2 , w 2 i, and ^22 all take values in the interval necessary for 
event E £ to occur is e/8 3 . □ 

Lemma 5.2. If event E £ occurs, then the belief of node u\ of K 2i2 at the end of the 4k-th 
iteration is incorrect for all integers k < ^ — 1. 
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Figure 2: The computation tree T 4fc (ui). 



Proof. Consider the computation tree T 4k (u\ ) (see Figure [2]) . According to Bayati et al. [U 
Lemma 1], the belief of node u\ of i^2,2 after 4k iterations is given by the two-dimensional 
vector fr 4 * = [2t 4fc (ui;l) 2t Ak {u\; 2)] *. This means that, after 4k iterations, the belief of 
node ui that it should be matched to v\ is equal to twice the weight of the maximum-weight 
T-matching of T Ak (ui) that selects edge (ui,vi) at the root. Analogously, after 4k iterations, 
the belief of node U\ that it should be matched to v-i is equal to twice the weight of the 
maximum- weight T-matching of T Ak (ui) that selects edge (ui, 1)2) at the root. The maximum- 
weight T-matching A that matches the root node to its child labeled V2, matches each node 
labeled ui to a node labeled V2 and each node labeled u-i to a node labeled vi, since this is the 
only possible T-matching that matches the root node to its child labeled x>2- Define A* as the 
T-matching that matches each node labeled u\ to a node labeled v\ and each node labeled 112 
to a node labeled V2- We show that A* has larger weight than A, which implies that the belief 
at node ui after 4k iterations is incorrect. We have 



Theorem 5.3. The probability that BP for MWM needs at least t iterations to converge for 
i^2,2 with edge weights drawn independently and uniformly from [0, 1] is at least — for some 
constant c > 0. 

Proof. We denote the number of iterations necessary for convergence of BP for MWM by r. 
Using Lemma 15.11 and Lemma 15.21 we have 



w(A*) - w(A) = (2k + l)iflu + 2kw 2 2 - (2k + l)wi 2 - 2kw 2 i 
= 2k(wn + W22 - wu - W21) + wn - W12 
> -2fee + l/4. 

Now —2ke + 1/4 is greater than zero if k < ^ — 1. 



□ 




1 1 



8 4 (^/4] +1) - ct 
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for some constant c > 0. 



□ 



Corollary 5.4. There exist bipartite graphs on n > 4 nodes, where n is a multiple of 4, with 
edge weights drawn independently and uniformly from [0, 1], for which the probability that BP 
for MWM needs at least t iterations to converge is 0( j) for t > n/c' for some constant c' > 0. 



Proof. The bipartite graph consists of n/4 copies of K22 and there are no edges between nodes 
in different copies of -^2,2 • If BP does not converge in less than t iterations for at least one of 
the n/4 copies of -^2,2 5 then BP does not converge in less than t iterations. This holds since a 
run of BP on this bipartite graph corresponds to n/4 parallel runs of BP on the n/4 copies of 
K2 2 • Using Theorem 15,31 we have that a constant c > exists such that 



(r < t) = (1 — F(BP needs at least t iterations for a particular copy of ^2,2)) 



n/4 



/ 1\" /4 / n\ n 

where the second inequality follows from 1 — x < exp(-x) and the last inequality follows from 
exp(— x) < 1 - § for x € [0, 1] and from ^ < 1 which holds if t > ^. □ 

5.3 Smoothed Analysis 

In this section we consider complete bipartite graphs K nn in the smoothed setting. We denote 
by X ~ U[a,b] that random variable X is uniformly distributed on interval [a, b]. In the fol- 
lowing we assume that > 26 and n > 2 and even. Similarly to the average case (Section I5.2|) 
we define the event Ef for K%p. and for < e < l/cf> as the event that wn € [l — ^,l] 
^12 G (lt> It + |] ' w 2l G (H, H + |] , and w 22 G [^12 + ^21 - Wu ~ e, ^12 + w 2 i - Wu) 
Consider the two possible matchings Mi = {en, 622} and M2 = {ei2,e2i}. If event occurs 
then the weight of M2 is greater than the weight of M\ and the weight difference is at most e 
In addition w\\ is greater than w\2 and the weight difference is at least ^ — §. 

Lemma 5.5. There exist probability distributions on [0,1] /or i/te weights of the edges, whose 
densities are bounded by 4>, such that the probability of event Ef is at least £0/4. 

Proof The intervals in which wn, W12, and W21 have to assume values in order for event e£ 
to occur all have a length of ^. We choose the corresponding probability distributions such 
that they have density <j) on the corresponding interval and density elsewhere. The interval 
in which W22 has to assume a value in order for event E% to occur has a length of e. It is 
contained completely in the interval [|| — ^, || + | ] , since 

23 23 1 20 1 

W\2 + W01 — Wv\ — £ > 1 1 = 7 

26 26 26 (f) 

and 

/23 1\ /23 1\ / 1\ 20 3 
^12 + ^21 -«, u <^_ + -j + ^_ + -j-^l--j=_ + -. 

We choose the probability distribution for 1^22 such that it has density £ on the interval 
[1§ — 3' 1§ + f ] an d elsewhere. Now the probability that u>n, wu, and W21 take values in the 
interval necessary for event e£ to occur is 1. For W22, this probability is £0/4. This completes 
the proof of Lemma 15.51 □ 
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Lemma 5.6. If event E$ occurs, then the belief of node u\ at the end of the Ak-th iteration is 
incorrect for all integers k < ^ — 1 . 

Proof. As in Lemma 15, 2\ a maximum- weight T-matching that selects the edge labeled (ui,v{) 
at the root has greater weight than a maximum-weight T-matching that selects the edge labeled 
(ui,V2) at the root for these values of k. □ 

Analogously to the proof of Theorem 15.31 Lemmas 15.51 and 15.61 above immediately yield a 
lower bound of O(0/i) for the probability that BP runs for at least t iterations. 

Our goal in the remainder of this section is to prove an Q(n(f)/t) lower bound for the complete 
bipartite graph. Thus, let us consider the complete bipartite graph K n>n = (U U V, E) with 
U = {4\pe {1,2}, j e {l,...,n/2}} and V = {v J q | q G {1,2}, j G {l,...,n/2}. Let W 
denote the subgraph induced by {u\, u\, v\, v^} for j G {1, . . . , n/2}. The role of the subgraphs 
H 3 is the same as the role of the copies of i^2,2 in the proof of Corollary 15.41 Let ej, q be the 
edge connecting u J p and v\ (p, q G {1, 2}, j G {1, . . . , n/2}). The weight of this edge is u? vq . We 
draw edge weights according to the probability distributions 



(I'- 



ll 



(/'- 



21 



U 



W a b ~ U 



1 



, . . 23 23 1 

U — , h - 

26 26 6 



0, 



1 



(('■ 



12 



! '22 



,23 23 1 
1 26' 26 + 6 



20 1 20 3 

26 ~ 6' 26 + 6 



(1) 



if u a G H j and v b G H k with j ^ k. 



We call the edges between nodes in the same induced subgraph H J heavy edges. Edges between 
nodes in different subgraphs H 3 and H k we call light edges. By assumption, we have (j) ^ 26. 
Thus, the weight of any light edge is at most 1/26, while every heavy edge weighs at least 
19/26. 

In contrast to the proof of Corollary 15. A\ we now have to make sure that light edges are not 
used in any computation tree. This allows us to prove the lower bound in a similar way as 
Theorem 15.31 and Corollary 15.41 



Lemma 5.7. Let A* be the maximum-weight T-matching on the computation tree T k (ui). 
Then A* does not contain any light edges. 

Proof. Assume to the contrary that A* contains a light edge (x,y). In that case x and y are 
in different subgraphs. The idea of the proof is to construct a path P from one leaf of the 
computation tree to another leaf that includes edge (x, y). Path P alternately consists of edges 
that are in A* and edges that are not. We show that a new T-matching of greater weight can 
be constructed by removing from A* the edges in P n A* and adding the edges in P\A*. 

We include the edge labeled (x, y) in P and extend P on both sides: We start with node 
zq = x and node zq = y, respectively, and construct the corresponding part of P as follows: 



1. for i = 1, 3, 5, . . . do 

2. if Zi—i is a leaf node then terminate. 

3. Let H k be the subgraph that Zi-\ belongs to. 
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4. Let ej = (zi-\,Zi) be the edge incident to Zi-\ that belongs to the optimal matching 
with respect to H k . 

5. Add a to P. 

6. if Zi is a leaf node then terminate. 

7. Let ej + i = be the (unique) edge incident to Zi that belongs to A*. 

8. Add e i+1 to P. 

It is clear that the procedure can only terminate if it finds a leaf. Moreover, the constructed 
sequence is alternating. Now we can show that no node will be visited twice: Otherwise there 
was an index i such that Zi—\ = Zi+i since we are moving in a tree. This can not happen, 
however, as the sequence is alternating. Hence, the procedure terminates. With the previous 
properties we also obtain that both paths constructed starting with zq = x and zq = y, 
respectively, are disjoint since z\ £ {x, y} in both cases. Consequently, we obtain one simple 
path P connecting two distinct leaf nodes and containing edge (x,y). 

We now show that the weight of the edges in P \ A* is strictly larger than the weight of the 
edges in P n A*. For this, let P be of the form P = (po, . . . ,pi), where i is even and where 
(PO)Pl) S A*. Let / C {1, ...,£} be the set of indices i for which (pi-i,Pi) is a light edge. 
Clearly, (pj_i,pj) € A* for each i € I by construction (see LineH]). Since the light edge (x,y) 
belongs to P we have 1^0. For i G I let Pi = (pi-i,Pi,Pi+i) be the subpath of P of length 2 
starting at node As (pi,p i+ i) is a heavy edge, w PiPi+1 - w Pi _ lPi > (§§ - |) - | = || - |. 
Hence, the difference in weight between the edge of Pi that belongs to A* and the other one is 
significant. 

Now remove all paths Pi from P and consider the subpaths of P (connected components) 
that remain. There are at most |/| + 1 such subpaths P', each has even length, and they 
only consist of heavy edges, i.e., all their edges lie in one subgraph H k where k depends on P' . 
Consider such a subpath P' and partition it into subpaths Pj of length 4 and, if the length of P' 
is no multiple of 4, into one subpath P of length 2. The A*-edges of Pj form the non-optimal 
matching on H k , whereas the other two edges form the optimal matching on H k . Hence, the 
total weight of Pj n A* is at most the total weight of Pj \ A*. Only for P we might have the 
case that the weight of P n A* is larger than the weight of P \ A*, but since both edges are 
heavy, the difference is at most 1 — (|g — -A = + -r. Hence, the difference between the total 
weight of P \ A* and the total weight of P n A* is at least 

lr . f20 2\ . /6 1\ lTl fU 3\ /6 1\ 4 

l / l'(26-«)-( |/| + 1 ''(26 + «)= |/| 'U-^-(26 + *)-26 > ° 

since |/| > 1 and <fi > 26. 

We can now construct a T-matching with heavier weight than A* by removing the edges in 
P H A* from A* and adding the edges in P \ A*. This contradicts the assumption that the 
maximum weight T-matching includes a light edge and proves the lemma. □ 

Theorem 5.8. There exist probability distributions on [0, 1] for the weights of the edges, whose 
densities are bounded by cf), such that the probability that BP for MWM needs at least t iterations 
to converge for K„ ;n is Q{n<j)/t) for t > nfyjc for some constant c > 0. 
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Proof. We choose the probability distributions for the edge weights according to (pQ). Let 
e = 52 (k+i) ^ or ^ = ^T^/^l an< ^ assume that event Ef occurs for subgraph H 3 , In this case the 
weight of matching M2 = {e J 12 ,e 21 } ^ s larger than matching M\ = {e^e^}, but at most by 
the small amount of e. Consider the computation tree T ik (u\). As in the proof of Lemma 15,2 
we know that if the maximum weight T- matching A* on T 4fc (itj) does not include the edge 
labeled e\ 2 at the root, then BP has not yet converged within the first 4k > t iterations (see 
Bayati et al. [4s Lemma 1]). 

We show that A* does not include edge e\ 2 . Assume to the contrary that it does. We know 
from Lemma 15.71 that A* does not contain light edges. Now we use the same procedure to 
create a path P from one leaf of T 4fc [u\ ) to another leaf that contains edge e\ 2 and alternates 
between edges from A* and edges from T ik {u\) \ A*. Since T Ak (u\) has height 4k + 1 and since 
u\ is the root of T ik {u\), path P contains exactly 8k + 2 edges, 2k + 1 of which are edges e{ 2 , 
Ik + 1 of which are edges e{ 1; 2k of which are edges e 22 , and 2k of which are edges e 21 . The 
ed ges ej 2 and e 2 ^ are exactly the edges of P fl A*. As in Lemma [521 the difference of weight 
between edges from P \ A* and P n A* is at least 

,■ .■ , // 1\ /23 1\\ 2k 
w n ~ w i2 ~ 2k£ > I I 1 - - I - I — + 



V26 4>)) 52(fe + l) 



3 2 1 

> >0 

26 4> 26 ~ 

since <fi > 26. This contradicts the fact that A* is optimal since removing from A* the edges 
in P fl A* and adding the edges in P \ A* yields a T-matching of heavier weight for T 4fc (ttj). 

We have shown that BP does not converge within the first t iterations if event Ef occurs 
for some subgraph H 3 . Since there are n/2 such subgraphs, we get that the probability that 
BP for MWM needs at least t iterations to converge for K n>n is fi( ^ ) since 

IP(' r < t) <F (Ef does not occur for any subgraph H 3 \ 

, A e(f)\ n/2 ( en<f)\ ( ncf) 

< I 1 —J < exp — = exp 



< 1 



4 J ~ " V 8 J V 8 • 52 • (4 • ft/41 + 1) 

n<j) 



2 • 8 • 52 • (4 • [t/4] + 1) 



where the second inequality follows from Lemma 15.51 The third inequality is due to the fact 
that 1 — x < exp(— x), whereas the last inequality stems from exp(— x) < 1 — f for x € [0, 1]. 
If x = 3.52.(4"t/4]+i) ^ s a ^ mos t 1, which holds for t > g^g, then the correctness follows. □ 

Note that the lower bound on the probability that BP for MWM converges within t itera- 
tions only differs a factor 0(m) from the upper bound from Section f4.il 



5.4 Other Variants of Belief Propagation 

The results of Sections 15.21 and 15.31 also hold for other versions of belief propagation for 
minimum/maximum- weight (perfect) 6-matching and min-cost flow [2JE1III] applied to the 
matching problem on bipartite graphs. The difference in the number of iterations until conver- 
gence differs no more than a constant factor. We omit the technical details but provide some 
comments on how the proofs need to be adjusted. 
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Some of the versions of BP consider minimum- weight perfect matching [2\ or min-cost flow [6] 
instead of maximum-weight perfect matching. For these versions we get the same results if we 
have edge weights w e = 1 — w e for all edges e. 

For some of the versions of BP [S1[TT] the root of the computation tree is an edge instead of 
a node . If we choose the root of this tree suitably, then we have that the difference in weight 
between the two matchings Mi and Mi of at most e not only has to 'compensate' the weight 
difference Au>(ei,e2) between an edge e\ in M\ and an edge e2 in M2, but the entire weight 
w e of an edge e in Mi or M%. However, the probability distributions for the edge weights in 
Sections 15.21 and 15.31 are chosen such that Au;(ei, 62) and w e do not differ more than a constant 
factor. 
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